Abstract: The term Big Data has been coined to refer to the gargantuan bulk of data that cannot be dealt with traditional data-handling techniques. Big data is new archetype that has been discovered in past years. We have reached in era of interconnectivity among different organization to predict and make decisions in time changing environments. Big Data is the term for any gathering of datasets so vast and complex that it gets to be distinctly troublesome to process using traditional data processing techniques. The challenges include capture, storage, analysis, data curation, search, transfer, visualization, quering, updating and information privacy. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as compared to separate smaller sets with the same total amount of data. Big data usually includes data sets with sizes beyond the ability of commonly used software tools to capture, select, manage, and process data within time.
Keywords: BIG DATA, 3V’s, HDFS, MAP REDUCE, HIVE, PIG, HBASE.